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Introduction 



This paper provides basic I/O performance and scalability information for the A6826A dual-channel 
2Gb Fibre Channel adapter on HP's Itanium-based Integrity servers. A series of tests were conducted 
to evaluate the adapter's performance on rx2600, rx4640, rx7620, rx8620 and HP Integrity Superdome 
servers. The tests measured the lOPs and throughput of single and multiple adapters. 

This paper addresses the performance capabilities of the A6826A adapter when used with the above 
mentioned HP Integrity servers. This paper also discusses system setup considerations necessary to 
obtain the maximum possible performance from A6826A adapters installed in these servers. 

This paper focuses on the following topics: 

• Test results: Single and multiple adapter performance data, including lOPs and throughput for 
a single port and dual ports, will be discussed. 

• Scalability: Multiple adapter scalability tests on rx2600, rx4640, rx7620, rx8620 and Integrity 
Superdome systems will be discussed. 

• System configuration guidelines: rx2600, rx4640, rx7620, rx8620 and Integrity Superdome system 
configurations and recommendations will be discussed. 

• Test details: The HP products used, the test setup, the benchmark tool used, and the system 
configuration in the test setup, will be discussed. 



Executive summary 

A6826A adapters installed in HP Itanium-based Integrity servers offer an excellent SAN solution. The 
A6826A adapter provides read throughput of 195MB/S, and write throughput of 186MB/S on a single 
port running at 2Gb/s. With both ports running at 2Gb/s, a read throughput of 390MB/S, and a write 
throughput of 372MB/S is achieved. 



A6826A performance summary 
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NOTE: 

*bd: Bidirectional Operation 

The above performance data was obtained by installing an A6826A adapter in a PCI-X 133MHz slot 
(Slot 8) of an rx4640 server. 



An A6826A adapter installed in a PCI-X slot of an rx4640 server offers excellent bi-directional 
performance. A single A6826A port running at 2Gb/s provides bi-directional throughout of 370MB/S 
and both A6826A ports running at 2Gb/s provide bi-directional throughout of 720MB/S with linear 
scaling. 

On other Integrity servers the A6826A adapter provides outstanding performance with linear scaling 
for up to 4 adapters when all eight ports are being used at 2Gb/s bandwidth. Additional adapters 
may be added to provide greater connectivity. 
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Test results 



HP obtained the performance data presented in this paper using the diskbench (db) benchmarking 
utility. Various block size tests were executed for read and write on single and dual ports. The lOPs 
metric was obtained with 1 KB block size transfers. The throughput metric was obtained with 128 KB 
block size transfers. The throughput metric is useful in modeling large sequential transfers, for example, 
remote backup. The lOPs metric is useful in modeling small transactional traffic. 

lOPs 



Chart la: IOPS 
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Chart la shows the number of I/O operations per second for read and write operations on a single 
port and on both ports of an A6826A adapter measured on a 4 way rx4640 server. The X-axis 
represents the number of Fibre Channel ports and the Y-axis represents the number of I/O operations 
per second. 

Chart la shows linear scaling of lOPs for one and two ports. The number of read I/O operations per 
second for a single port and for dual ports is 35180 and 70032 respectively. The number of write 
operations per second for a single port and for dual ports is 20619 and 41221 respectively. The lOPs 
metric is limited by the processor used on the A6826A adapter. 

Chart lb shows the %CPU utilization for I/O operation tests. The X-axis represents the number of Fibre 
Channel ports. The Y-axis represents the % CPU utilization. To accurately record CPU utilization, only 
one CPU was configured in these tests. Chart lb demonstrates excellent CPU utilization numbers for 
small size l/Os on both ports of an A6826A adapter. 

Service demand 

Service Demand is a measure of the CPU time needed to handle 1 KB of I/O. The Service Demand 
number for an adapter is a measure of how much load the adapter places on the CPU(s) for each 
KB/s of throughput. The A6826A adapter offers low CPU service demand for small l/Os. To illustrate the 
service demand for the A6826A adapter, small size operations tests were conducted using db. 
Sequential Read and Write operations of 4KB and 8KB I/O sizes were performed on an rx4640 server 
with one CPU configured. 

The following table shows single and dual port throughput and CPU utilization for raw Sequential Read 
and Sequential Write operations with 4KB and 8KB I/O sizes: 
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Number 
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NOTE: 

1 Throughput in MB/s 

2 Single CPU utilization numbers 

3 Service Demand in microseconds calculated using ((%cpu utiiization/ioo)/ Throughput in kb/s)(ioa6) 
Data gathered on an rx4640server configured with a single CPU 



Throughput 

Chart 2a: Single and Dual Port Throughput 



Chart 2b: CPU Utilization for Throughput Tests 
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Data gathered on an rx4640 server configured with a single CPU 



Chart 2a shows the 128KB raw sequential read and write throughput of single and dual ports on an 
A6826A adapter. The X-axis represents the number of Fibre Channel ports. The Y-axis represents the 
throughput in MB/s. 

Chart 2a shows one A6828A port performing read operations at 195MB/S. Two A6826A ports scale to 
2x the performance of a single A6826A port, performing read operations at up to 390MB/S. 

For write operations, one A6826A port performs at 186MB/S. Two A6826A ports scale to 2x the 
performance of a single A6826A port, performing writes at up to 372MB/S. 

Chart 2a demonstrates outstanding read and write performance with linear scaling for 1 and 2 ports 
on the A6826A adapter. 



Chart 2b shows the % CPU utilization in throughput tests. The X-axis represents the number of Fibre 
Channel ports. The Y-axis represents the % CPU utilization. To accurately record the CPU utilization, only 
one CPU is configured in these tests. Chart 2b shows very small percentages of CPU utilization for one 
or two A6826A ports in read and write throughput tests. 
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Scalability 



A series of tests were conducted to evaluate the performance scalability of the A6826A adapter on 
rx2600, rx4640, rx7620, rx8620 and HP Integrity Superdome servers. 



Chart 3a: 128KB Sequential Read Throughput 
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N umber of Ports 

Qrx2600 with two 900MHz Itanium2 processors and 8GBPhysical memory 
[Jrx4640 with four 1.3GHz Itanium 2 processors and 8GB Physical memory 

I | rx7620, 2 cell partition, each cell with four 1 .3 GHz Itanium 2 processors and 8GB Physical memory 
Q rx8620, 2 cell partition, each cell with four 1 .3GHz Itanium 2 processors and 8GB Physical memory 
LH Integrity Superdome, 2 cell partition with eight1.5GHz Itanium2 processors and 8GB physical memory 



Chart 3a shows the read throughput performance of 1, 2, 3 and 4 A6826A adapters installed in rx2600, 
rx4640, rx7620, rx8620, and HP Integrity Superdome servers. The X-axis represents the number of ports. 
The Y-axis represents the throughput performance in MB/s. 

Chart 3a shows excellent read throughput performance with linear scaling for 4 A6826As installed in 
rx2600, rx4640, rx7620, rx8620 and HP Integrity Superdome servers, reaching an aggregate throughput 
of 1 523MB/S. A6826A adapters installed in HP Integrity servers offer an excellent scalable SAN solution. 
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Chart 3b: 128 KB Sequential Write Throughput 
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N umber of Ports 



Qrx2600 with two 900MHz Itanium 2 processors and 8GB Physical memory 
[J rx4640 with four 1 .3GHz Itanium 2 processors and 8GB Physical memory 

I | rx7620, 2 cell partition, each cell with four 1 .3 GHz Itanium 2 processors and 8GB Physical memory 
rx8620, 2 cell partition, each cell with four 1 .3GHz Itanium 2 processors and 8GB Physical memory 
HB Integrity Superdome, 2 cell partition with eight 1 .5GHz Itanium 2 processors and 8GB physical memory 



Chart 3b shows the write throughput performance of 1, 2, 3 and 4 A6826A adapters installed in rx2600, 
rx4640, rx7620, rx8620 and HP Integrity Superdome servers. The X-axis represents the number of ports. 
The Y-axis represents the throughput performance in MB/s. 

Chart 3b shows excellent write throughput performance with linear scaling for 4 A6826A adapters 
installed in rx2600, rx4640, rx7620, rx8620 and HP Integrity Superdome servers, reaching an aggregate 
throughput of 1496 MB/s. 

The table below summarizes the maximum number of cards that can be installed in the tested servers 
while maintaining linear scalability. Additional adapters may be installed to provide better 
connectivity. 
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System configuration guidelines 



NOTE: Some PCI-X 133 MHZ slots on Integrity Servers have higher bandwidth than other PCI-X 133 MHZ 
slots on the server. In this whitepaper, we will refer to these higher bandwidth PCI-X 133 MHZ slots as 
high performance PCI-X 133 MHZ slots. 

rx2600 

The rx2600 is a 2-way entry-level Integrity server with 4 PCI-X 133MHz slots. A6826A adapters installed in 
an rx2600 server offer a great scalable solution. 

rx4640 

The rx4640 is a 4 way entry level Integrity server with eight I/O slots. Slots 1 and 2 are reserved for core 
SCSI and core LAN. Slots 3, 4, 5, and 6 are shared slots operating at 3.3v, 64-bit PCI-66MHz. Slots 7 and 8 
are high performance PCI-X 133MHz slots. In order to achieve linear scalability with up to four A6826A 
adapters, HP recommends the first two A6826As be installed in slot 7 and slot 8, the next two A6826As 
should be installed in the shared slots, but avoid installing A6826A adapters in adjacent shared slots. 



rx7620 / rx8620 

The rx7620 and rx8620 are cell based mid-range Integrity servers. The I/O subsystem in the rx7620 and 
rx8620 offers 16 PCI-X 133MHz slots when these servers are configured with 2 cells. Fourteen of the 
sixteen slots are high performance PCI-X 133MHz slots. HP recommends A6826A adapters be installed 
in high performance PCI-X 133MHz slots (Physical slot numbers 1, 2, 3, 4, 5, 6, 7) to achieve the 
performance demonstrated in this paper. To obtain linear scalability, HP recommends the rx7620 or 
rx8620 be configured with a 2-cell partition: Two A6826As should be installed in any of the PCI-X slots 
numbered 1 to 7 and a maximum of two A6826As should be installed in any of the PCI-X slots 
numbered 9 to 15. Each cell should be configured with a multiple of 8 equal capacity DIMMs to take 
advantage of memory interleaving. The DIMMs on a cell should be evenly distributed across the 
busses. 



HP Integrity Superdome 

The HP Integrity Superdome is a cell based high-end Integrity server. Each Superdome I/O Cage 
subsystem offers 4 high performance PCI-X slots capable of delivering 1066MB/S peak bandwidth. HP 
recommends A6826A adapters be installed in high performance PCI-X slots (Physical slot numbers 4, 5, 
6, 7) to achieve the performance demonstrated in this paper. To obtain the linear scalability 
demonstrated in this paper, an Integrity Superdome partition should be configured with at least two 
cells with 8 Gb of physical memory and not more than two A6826As should be installed in any of the 
four high performance slots in each I/O cage. Each cell should be configured with a multiple of 8 
equal capacity DIMMs to take advantage of memory interleaving. The DIMMs on a cell should be 
evenly distributed across the busses. 
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Test details 



The performance results presented in this paper were obtained with A6826A adapters installed in 
various HP Integrity servers. The system configurations used in the test setup are detailed in the 
following table: 



Products used in testing 



products used for performance measurement test 



Servers 
Tested 








■ 



rx2600 



Configured with 

> Two 900MHz Intel Itanium 2 
Processors 

> 4GB System Memory 
HP-UX lliv2 0S 

A6826A Fibre Channel driver (Ver. 
B.l 1.23.01) 



rx4640 



Configured with 

> Four 1 .3GHz Intel Itanium 2 
Processors 

> 8 GB System Memory 
HP-UX 1 1 i v2 OS 

A6826A Fibre Channel driver (Ver. 
B.l 1.23.01) 



rx7620 



2 Cell partition, each cell with 

> Four 1 .3GHz Intel Itanium 2 
Processors 

> 8GB System Memory 
HP-UX lliv2 0S 

A6826A Fibre Channel driver (Ver. 
B.l 1.23.01) 



rx8620 



2 Cell partition, each cell with 

> Four 1 .3GHz Intel Itanium 2 
Processors 

> 8GB System Memory 
HP-UX 1 1 i v2 OS 

A6826A Fibre Channel driver (Ver. 
B.l 1.23.01) 



Integrity Superdome 

• 2 Cell partition, each cell with 

> Fourl .5GHz Intel Itanium 2 
Processors 

> 8GB System Memory 
HP-UX lliv2 OS 

• A6826A Fibre Channel driver (Ver. 
B.l 1.23.01) 



A6826A HBA 

• PCI-X dual-channel 2Gb Fibre 
Channel adapter 

• Each port capable of independently 
operating at 1 or 2 Gb/s (Auto- 
Negotiation) 

33/66/1 00/1 33MHz-64bit capable 



Benchmark software 




db 



Diskbench (db) is the benchmark suite that 
generated disk read and write traffic for these 
tests. 



HP StorageWorks 2Gb/s Disk System 



DS2405 
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Test configuration 



The rx4640 test configuration consisted of a 4-way rx4640 with four 1 .3GHz processors, 8 GB of system 
memory and 4 A6826A adapters installed. Two A6826A adapters were connected to one HP 2Gb/s Fibre 
Channel switch and the other two A6826A adapters were connected to the another HP 2Gb/s Fibre 
Channel switch. Sixteen HP DS2405s were evenly connected to the 2 switches. The switches were zoned 
so each zone consisted of a single A6828A port and 2 DS2405s. A Single CPU was configured on the 
rx4640 to obtain the CPU utilization demonstrated in the lOPs and throughput tests for single and dual 
A6828A ports. Additional CPUs were added to study the scalability of the A6826A on the rx4640. 

A similar test setup was used to collect performance data on the rx2600, rx7620, rx8620 and Integrity 
Superdome. 

test setup 
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HP 2G FC Switch 
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Additional information 



For more information about the A6826A adapter please visit: 
http://www.hp. com/products 1 /serverconnectivity/storaqesnf2/index.html. 



For more information about HP's Itanium-based Integrity servers please visit: 
http://www.hp.com/productsl /servers/inteqrity/index.html 
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